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by  I 

Philip  S.  Yu 

ABSTRACT 

Formulation  of  given  computer  system  or  network  problems  into  abstract 
stochastic  models  is  considered.  Generally  speaking,  model  formulation  is 
an  art.  While  analytic  results  are  clearly  not  powerful  enough  to  provide 
a "cookbook"  approach  to  modelling,  general  methodology  and  difficulties  on 
model  formulation  are  discussed  through  examinations  of  various  computer 
system  and  network  models.  These  models  are  presented  in  a systematic  way 
based  on  the  hierarchical  approach. 


The  work  described  herein  was  supported  in  part  by  the  Ballistic  Missile 
Defense  Systems  Command  under  contract  no.  DASG60-77-C-0073. 


1 INTRODUCTION 


In  this  report , formulation  of  given  computer  system  or  network 
problems  into  abstract  stochastic  models  is  considered.  The  model 
formulation  task  is  very  crucial  and  sometimes  more  difficult  than  model 
solving.  Generally  speaking,  model  formulation  is  an  art.  There  do  not 
exist  systematic  rules  which  can  convert  computer  system  or  network 
problems  into  abstract  stochastic  models.  Furthermore,  the  stochas- 
tic model  for  a given  problem  is  by  no  means  unique . The  same  prob- 
lem can  be  modelled  to  various  levels  of  details  under  various  degrees 
of  complexities.  Since  it  is  impossible  to  include  all  the  details 
of  a problem  into  a single  stochastic  model,  we  have  to  determine  the 
most  crucial  factors  affecting  the  performance  cuid  try  to  incor- 
porate them  into  the  stochastic  model.  If  the  problem  can  be  modelled 
by  a conventional  queueing  model,  there  often  exist  systematic  ways  to 
obtain  an  exact  or  approximate  solution  of  the  model.  Otherwise,  we  can 
only  try  to  attack  the  problem  using  more  general  stochastic  models 
and  ad  hoc  techniques. 

We  will  consider  the  model  formulation  of  typical  computer  system 
or  network  problems  in  fairly  details.  A multiprogrammed  time  shared 
computer  system  nay  consist  of  several  loosely  coupled  computers  sharing 
secondary  storage  devices.  Each  loosely  coupled  computer  may  consist 
of  several  tightly  coupled  CPU's  contending  for  main  memory.  The 
computer  system  itself  may  be  part  of  a computer  communication  network. 
An  important  modelling  methodology  referred  to  as  hierarchical  modelling 
is  introduced  and  used  as  the  basic  foundation  for  systematically 
modelling  various  problems  in  a bottom  up  fashion.  Problems  being 
modelled  consist  of  the  interference  problem  and  software  lockout 
problem  in  a tightly  coupled  multiprocessor  system,  similar  problems 
in  a loosely  coupled  multiprocessor  system,  the  performance  of  a 
multiprogrammed  time  shared  computer  system  and  finally  the  tremsmission 


delay  in  a store  and  forward  computer  communication  network  via  terres- 
trial links.  We  hope  the  reader  can  gain  some  feeling  on  model  formulation 
through  these  models.  There  usually  is  a tradeoff  between  mathe- 
matical tractability  and  fidelity  of  a model.  We  are  just  presenting 
or  compciring  possible  models  of  a given  problem. 
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2.  QUEUEING  NETWORK  MODELS  WITH  PRODUCT  FORM  EQUILIBRIUM  STATE 
POSSIBILITIES 


A job  or  process  in  the  computer  system  usually  will  go  through  a 
sequence  of  service  requests  on  different  devices  at  different  stages 
of  its  lifetime.  To  be  more  precise,  starting  with  a request  for  CPU 
service,  a process  will  later  request  service  from  a paging  device 
when  a page  fault  occurs.  After  service  completion  at  the  paging 
device,  it  will  request  CPU  service  again.  The  same  story  repeats. 

It  may  also  request  service  from  I/O  devices  later  on.  To  capture 
this  kind  of  correlations  among  different  service  devices  in  a 
computer  system,  queueing  network  models  have  been  proven  to  be  very 
effective.  There  is  a broad  class  of  queueing  network  models  whose 
equilibrium  state  probabilities  are  in  product  form.  That  is  to  say 

P(S^,  . . . , ) = G P(S^)  . . . P(S|^) 

where  M is  the  number  of  service  centers  in  the  network.  P(S^)  is  the 
probability  that  the  i-th  queue  is  in  state  and  P(S^,  . . . , 
is  the  probability  that  the  network  is  in  state  (S^^,  . . . , S^) . 

G is  a normalization  constant,  chosen  so  that  the  sum  of  the  proba- 
bilities of  all  network  states  is  unity. 

Queueing  network  models  were  first  introduced  by  Jackson  [42]  where 
the  service  time  at  each  server  has  negative  exponential  distribution. 

A good  survey  on  the  development  of  the  theory  of  queueing  network  models 
up  to  1972  can  be  found  in  Muntz  and  Baskett  [6l].  The  recent  most  note- 
worthy progress  in  extending  the  class  of  analytically  solvable 
queueing  networks  has  been  done  by  Baskett,  Chandy,  Muntz  and  Palacios 
[4].  These  authors  have  succeeded  in  casting  into  a unified  theory  of 
previously  known  but  unconnected  results  such  as  queue  size  distributions 
for  M/M/1  with  FCFS  discipline,  general  service  time  distribution  for 
processor  sharing,  infinite  server  discipline  and  LCFS  preemptive 
resumed  discipline  and  queueing  systems  with  various  classes  of 
customers.  For  ease  of  illustrating  model  formulation  in  later  sections. 


we  summarize  the  characteristics  of  these  network  models  below. 


1.  There  is  no  restriction  on  network  topology. 

2.  There  is  no  restriction  on  the  number  of  service  centers. 

3.  Four  kinds  of  service  centers  are  allowed,  namely 

a.  First  come  first  served  (FCFS) 

b.  Processor  sharing  (PS) 

c.  Infinite  server  (IS) 

d.  Last  come  first  served  (LCFS)  preemptive  resumed. 

4.  Customers  traverse  through  the  network  under  some  fixed 
routing  chains. 

5.  A routing  chain  may  be  open  or  closed  depending  upon  whether 
external  arrivals  and  departures  are  allowed. 

6.  The  external  arrival  process  is  Poisson. 

7.  The  service  time  distribution  at  each  type  of  service  center 


is  as  follows 

a.  FCFS:  exponential  distribution 

b.  PS,  IS,  LCFS:  any  distribution  with  a rational  Laplace 
transform. 

8.  Customers  can  have  different  classes.  Customers  in  different 
classes  can  have  different  routing  chains  and  service  time 
distributions  at  PS,  IS  or  LCFS  service  centers. 

9.  The  service  rate  may  depend  upon  the  number  of  customers  in 
a service  center. 

Reiser  and  Kobayashi  [67]  generalize  the  above  result  to  the  case 
in  which  customer  transitions  are  characterized  by  more  than  one  closed 
Markov  chain.  The  technique  of  generating  function  has  been  applied  to 
obtain  closed  form  solutions.  Kobayashi  and  Reiser  [54]  further  extend 
the  job  routing  behavior  to  high  order  Markov  chains,  i.e.  the 
transition  probability  of  a job  from  one  station  to  the  other  can 
depend  on,  at  least,  the  last  two  stations  it  has  visited  and  not  just 
the  last  one.  In  Lam  [55],  the  class  of  queueing  networks  with  a 
product  form  solution  is  extended  to  include  state  dependent  lost 
arrivals  and  trigger  arrivals.  Such  queueing  network  models  can  be 
used  to  model  store  and  forward  packet  switching  nodes  and  multi- 
programming computer  systems  with  storage  constraint.  Chandy  et  al  [16] 
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introduces  a property  of  queueing  discipline,  station  balance,  which 
seems  to  explain  why  some  service  disciplines  yield  product  form 
solutions  for  queues  and  networks  with  nonexponential  service  time  and 
other  disciplines  do  not.  Previous  results  on  local  balance  are  also 
generalized  to  arbitrary  differentiable  service  distribution  functions. 

In  addition  to  the  stationary  queue  length  distribution  and 
performance  measures  derivable  from  it,  such  as  device  utilization  and 
job  throughput,  distributions  of  response  times  or  passage  times, 
which  are  the  times  required  for  a job  to  traverse  a portion  of  the 
network,  are  other  measures  receiving  a great  amount  of  interest. 
Nevertheless,  analyses  based  on  number-in-queue  s.tate  space  can  only 
yield  the  expected  values  of  passage  times.  Yu  [8]  obtain  passage 
time  distributions  for  the  closed  network  models  in  Baskett  et  al  [4] 
by  transforming  the  passage  time  problem  into  a hitting  time  of  a 
Markov  process.  Efficient  numerical  approximation  and  generalization 
to  open  queueing  networks  are  also  considered  in  Yu  [8] . 
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3.  DIFFUSION  APPROXIMATION  | 


There  still  is  a large  class  of  queueing  systems  which  do  not  fit 
into  the  queueing  network  models  described  in  the  previous  section. 

One  particular  example  will  be  a general  queueing  network  with  first 
come  first  served  discipline  and  general  service  time  distribution 
at  each  service  center.  Recently,  considerable  effort  has  been  made 
for  obtaining  approximate  solutions  to  these  queueing  systems  using 
the  diffusion  approximation  when  the  systems  are  congested.  The 
advantage  of  diffusion  approximation  lies  in  the  fact  that  explicit 
results  can  be  obtained  for  relatively  complex  situations  where  the 
only  possible  alternatives  are  simulation  experiments  cr  sometimes 
numerical  methods  (polynomial  root  finding  methods)  [A].  This  greatly 
extends  our  capability  in  modelling  practical  problems . 

In  order  to  alleviate  the  difficulty  due  to  the  general  service 
time  distribution,  the  diffusion  approximation  replaces  the  discrete 
jump  process  such  as  the  queue  size  process  by  a diffusion  process  which 
is  a continuous  path  stochastic  process.  The  probability  distribution 
of  the  diffusion  process  which  satisfies  a partial  differential  equation 
is  quite  often  more  amenable  to  mathematical  analysis  than  that  of  the 
jump  process.  However,  the  approximation  by  diffusion  process  requires 
the  heavy  traffic  assumption. 

Based  on  central  limit  theorem,  Kingman  [46]  has  shown  in  his 
treatment  of  heavy  traffic  theory  that  the  waiting  time  distribution 
is  as  an  approximation  exponentially  distributed,  where  the  parameter 
depends  only  on  the  mean  and  variance  of  the  Interarrival  time  and 
service  time  distribution,  l.e.,  it  is  insensitive  to  the  detailed  form 
of  the  distribution,  as  the  traffic  Intensity  approaches  1.  The 
diffusion  approximation  based  on  the  same  idea  attempts  to  overcome  the 
limitation  of  the  exponential  model  by  considering  both  the  mean  and 
variance  of  the  service  time  and  Interarrival  time  distributions. 

Newell  [62]  gives  an  extensive  treatment  of  queues  with  time  dependent 
arrival  rate  through  use  of  the  diffusion  approximation  in  his  monograph. 
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Gaver  applies  the  diffusion  approximation  method  to  the  waiting  time  in  a 
M/G/1  queue  [30].  Gaver  and  Shedler  [32] [33]  apply  this  technique  to 
analyze  a multiprograiraned  computer  system  modelled  as  a two  stage 
cyclic  network.  Kobayashl  [50]  considers  the  multi-dimensional 
diffusion  approximation  as  a technique  for  treating  general  queueing 
networks.  Reiser  and  Kobayashi  [67]  study  the  accuracy  of  diffusion 
approximation  techniques  and  propose  a way  to  treat  each  server  in  the 
queueing  networks  separately.  Gleenbe  [34] [35]  suggests  a different  way 
to  handle  the  boundary  condition  of  the  diffusion  process,  namely  using 
Feller's  elementary  return  process  [24].  In  [36],  Gelenbe  also  investi- 
gates the  idea  of  decomposing  a queueing  network  into  separate  single 
servers.  Halachnl  and  Franta  [38] [39]  extend  diffusion  approximation 
to  multiserver  queueing  systems.  Yu  [80]  Investigates  the 
applicability  conditions  of  diffusion  approximation  and  the  decomposa- 
blllty  of  queueing  networks,  which  have  been  neglected  in  the  past,  and 
improves  its  accuracy  by  employing  a new  way  to  estimate  the  diffusion 
parameters.  Generalization  to  servers  with  a queue  dependent  service 
rate  is  also  considered  in  [80].  An  application  of  diffusion 
approximation  to  analyze  the  performance  of  an  ALOHA-like  system  can 
be  found  in  Kobayashi  et  al  [53]. 
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4.  MODEL  FORMULATION 
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In  this  section,  we  summarize  some  basic  guidelines  for  formulating 
a computer  system  and  network  problems  into  queueing  models  below: 

(1)  Identify  service  centers  and  customers  In  a given  problem.  For 
example.  In  a multlprograiamed  computet  system  model,  C?D,  drums  and 
disks  are  service  centers  and  processes  are  customers.  In  the 

cos.- .unicatlon  subnet  of  a computer  communication  netwcrk,  channels  are 
servers  and  messages  are  customers. 

(2)  Identify  the  type  of  each  service  center.  In  the  v'.arkovian  queue tr.g 
network  models  described  by  Baskett,  Chandy,  Muntz  and  Palacios 

[6]  , four  kinds  of  service  centers  are  allowed,  namely,  FCFS,  PS,  IS 
and  LCFS  premptlve  resumed.  In  a multiprogrammed  computer  system  model, 
CPU  is  usually  modelled  as  PS  or  LCFS  premptlve  resumed  type  of  service 
center  where  PS  is  the  limiting  case  of  round  robin  scheduling  and 
LCFS  premptlve  resumed  represents  an  efficient  preemptive  resumed  CPU 
scheduling.  Furthermore,  drums  and  disks  are  usually  modelled  as  FCFS 
type  of  service  centers.  To  be  more  specific,  let  us  take  a closer 
look  on  how  drums  and  disks  are  modelled.  As  a first  order  approxi- 
mation, we  often  model  a multiple  spindle  disk  drive,  which  consists 
of  several  moving  head  disks  connected  to  a single  I/O  channel,  under 
FCFS  spindle  scheduling  [79 1 by  a FCFS  service  center.  Recall  that 
accessing  a record  on  moving  head  disks  requires  three  sequential 
operations,  i.e.,  the  seek  operation,  rotational  positioning  operation 
and  the  data  transfer  operation.  In  the  seek  operation,  the  read/write 
heads  are  mechanically  positioned  to  the  cylinder  containing  the 
record  that  was  requested.  In  the  rotational  positioning  operation,  the 
records  passing  under  the  read/write  head  are  examined  to  find  the  one 
to  be  transferred;  the  disk  is  said  to  be  latent  during  this  operation 
and  the  time  taken  is  latency  time.  Finally,  in  the  data  transfer 
operation,  data  is  transmitted  through  the  1/0  channel  to  or  from  main 
memory.  Each  spindle  of  a multiple  spindle  disk  drive  can  seek  indepen- 
dently of  the  other  spindles.  However,  the  seek  commands,  which  initiate 
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the  seek  operations,  and  the  transfer  conmands,  which  start  the  latency- 
transfer  sequences,  must  pass  through  the  channel  to  the  spindles.  The 
I/O  channel  can  only  pass  a command  or  perform  a rotational  positiordng 
or  data  transfer  operation  for  one  channel  at  a time.  Apparently, 
the  FCFS  service  center  is  only  a very  rough  approximation  of  the  actual 
system.  The  main  problem  on  modelling  comes  from  the  fact  that  three 
operations  are  involved  where  some  of  them  need  dedicated  service  from 
the  I/O  channel  and  some  of  them  only  need  to  be  initiated  by  the  I/O 
channel.  More  sophisticated  models  capturing  this  problem  and  the 
effect  of  scheduling  disciplines  can  be  found  in  Wilhelm  [79]  . For 
paging  drums,  we  often  model  each  sector  of  a drum  as  a FCFS  service 
center  when  shortest-latency-time-first  scheduling  is  used  or  model 
each  drum  as  a FCFS  service  center  when  FCFS  scheduling  is  used.  Note 
that  the  access  of  a record  on  drums  consists  of  a rotational  latency 
and  the  actual  transfer  of  data.  The  latency  can  be  reduced  by 
appropriate  choice  of  scheduling  policy.  To  capture  the  effect  of 
latency  and  scheduling  policy  on  system  performance,  again  more  com- 
plicated stochastic  models  must  he  used  (see  Fuller  and  Baskett  [29]  ). 

In  a computer  communication  network  via  random  access  channel,  the 
server,  i.e.,  the  random  access  channel,  behaves  very  differently  from 
conventional  servers.  For  example,  under  the  ALOHA  scheme,  when  the 
customer  (the  message)  arrives  it  receives  immediate  service  from  the 
server.  If  the  service  time  of  more  than  one  customer  overlaps  in 
time,  the  service  is  in  vain.  All  customers  have  to  be  served  again 
after  a random  delay.  This  is  referred  to  as  the  message  conflict  pro- 
blem. In  this  case,  the  conventional  queueing  network  model  will  not  be 
sufficient  to  model  the  system  performance.  Ad  hoc  techniques  or  more 
sophisticated  stochastic  models  must  be  employed  to  evaluate  the  system 
performance  (see  Kleinrock  [49]  , Yu  [82]  ) 

(3)  Identify  different  customer  classes  in  the  system.  Different 
classes  of  customers  can  have  different  service  time  distributions  at 
IS,  PS  and  LCFS  service  centers,  and  different  routing  probabilities. 
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In  a multiprogranuned  computer  system,  usually  the  CPU  bound  and  I/O 
bound  processes  are  distinguished  as  two  different  classes  of  processes 
or  customers. 

(4)  Determine  whether  the  queueing  model  is  open  or  closed,  l.e., 
whether  external  arrivals  are  allowed.  Closed  queueing  models  will 
be  chosen  if  the  total  number  of  customers  is  fixed,  e.g.,  a multi- 
programmed  computer  system  with  fixed  degree  of  multiprogramming  under 
saturated  load. 

(5)  Determine  the  service  time  distribution  of  each  service  center 
and  the  Interarrival  time  distribution  if  the  queueing  model  is  open 
by  empirical  data  or  hardware  specifications.  Exponential  distribu- 
tion is  often  used  as  a first  order  approximation.  In  this  case,  only 
the  mean  of  the  distribution  needs  to  be  measured.  Kctice  the  mean 
service  time  or  service  rate  may  be  queue  dependent.  The  conventional 
G/M/m  (multiserver)  queueing  system  is  a special  kind  of  service  center 
with  queue  dependent  setrvlce  rate. 

(6)  Determine  the  routing  chain  (hence  the  network  topology)  for  each 
class  of  customers  by  its  characteristic  and  empirical  data. 
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5.  HIERARCHICAL  MODELLING 

Let  us  further  consider  some  difficulties  encountered  in 
modelling  computer  systems  and  networks  • [5l]  The  most  apparent  one  is 

the  multiplicity  of  interacting  parameters  which  are  related  to  activi- 
ties of  different  time  scales  from  nano-  or  microseconds  to  minutes. 

Another  source  of  difficulty  is  that  tasks  or  processors  in  a computer 
system  often  occupy  more  than  one  resource  at  a time.  For  example, 
an  executing  task  holds  not  only  a CPU  but  also  a portion  of  main 
memory.  The  amount  of  main  memory  allocated  to  the  task  clearly  has 
an  important  effect  on  system  performance.  Furthermore,  the  very  same 
subject  can  be  viewed  either  as  a customer  or  as  a server  depending  on 
the  problem  addressed.  A problem  of  this  sort  which  we  will  encounter 
in  later  sections  is  that  in  the  memory  interference  problem  the  roles 
of  processors  are  servers,  but  in  the  software  lockout  problem  they  are 
customers.  Since  a subject  cannot  be  both  server  and  customer  in  the 
same  queueing  model,  these  two  problems  cannot  be  formulated  in  terms  of 
a single  queueing  model. 

The  multiplicity  of  interacting  parameters  or  the  size  of  a computer 
system  or  network  model  can  be  reduced  by  properly  decomposing  the  model  | 

into  submodels  and  evaluating  them  in  a hierarchical  fashion  starting 
from  the  bottom  level  if  the  system  is  nearly  completely  decomposable 
[20].  This  approach  is  referred  to  as  hierarchical  modelling  which  is  ! 

proposed  and  formulated  by  Sekino  [71],  Courtois  [20],  Browne  et  al  [11], 
Brandwajn  [9],  Kobayashi  [51],  Brown  et  al[10].  Furthermore,  the  concept  j 

of  hierarchical  modelling  provides  a partial  solution  to  some  of  the  j 

difficulties  cited  above  besides  size  reduction  of  a model  as  we  shall  | 

see  later. 

A system  is  called  nearly  completely  decomposable  [20],  if  all 
variables  can  somehow  be  clustered  into  a small  number  of  groups  such 
that 

(a)  The  interactions  among  the  variables  of  each  group  may  be 

studies  as  if  Interactions  among  groups  did  not  exist. 
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(b)  Interactions  among  groups  be  studied  with  reference  to  the 

interactions  within  groups. 

Alternatively,  if  Intergroup  dependencies  arc  sufficiently  weak 
as  compared  to  Intragroup  ones,  in  the  short  run  the  system  may  be  con- 
sidered as  a set  of  Independent  subsystems  which  may  be  approximately 
analyzed  separately  from  one  another.  In  the  long  run,  the  whole  system 
appears  to  evolve  keeping  roughly  the  state  of  relative  equilibrium 
within  each  subsystem. 

A basic  requirement  for  near  complete  decomposability  as  observed 
by  Courtois  (20)  is  that  the  intragroup  Interaction  has  transient  time 
coiiJt.i:'.'.  s which  are  much  shorter  than  the  mean  time  between  Intergroup 
interactions.  To  be  more  <-  -'crete,  let  us  consider  how  to  apply  the 
concept  of  near  complete  deconii.  -•’h-tlity  to  computer  system  modelling. 
Since  the  time  scale  of  the  interactxu.  nlays  an  essential  role  in  the 
decomposition,  it  should  not  be  surprise  g that  the  modelling  hierarchy 
and  storage  hierarchy  related.  A diagram  of  a storage  hierarcliy  is  pre- 
sented in  big.  5.1  and  that  of  an  abstract  modelling  iiierarchy  based 
on  that  in  Kobayashi  (51)  is  presented  in  Fig.  5.2.  In  this  representa- 
tion of  hierarchical  modelling,  model  A is  at  the  lowest  level.  The 
time  between  events  being  modelled  in  model  A is  on  the  order  of  a 
microsecond.  The  highest  level  storage  Involved  in  the  storage 
hierarchy  is  main  memory.  An  example  of  a system  at  this  level  will 
be  one  with  tightly  coupled  CPU's,  each  of  which  has  a cache  and 
occasionally  makes  read/vrrite  requests  to  large  main  memory  or  inter- 
leaved memory.  Problems  to  be  analyzed  at  this  level  of  models  «d.ll 
Include  memory  Interference  of  a multiprocessor,  the  effect  of  inter- 
leaving, cache  mapping  algorithm,  etc.  Model  B is  one  level  higher 
than  model  A and  the  time  between  events  in  model  B is  cn  the  order  of 
a millisecond.  The  highest  level  storage  involved  in  the  storage 
hierarchy  is  the  paging  device.  An  example  of  a system  at  this  level 
will  be  one  with  loosely  coupled  CPU's,  each  of  which  has  its  own  main 
memory  and  shares  common  paging  devices.  Problems  such  as  memory 
management  should  be  studied  using  models  at  this  level.  Model  C is 


one  level  higher  than  model  B and  the  time  between  events  is  an  order 
of  magnitude  larger  than  that  under  model  B.  The  highest  level  storage 
involved  in  the  storage  hierarchy  is  the  filing  disk.  The  most  common 
example  at  this  level  is  a multiprogrammed  system  model  where  customers 
are  tasks  currently  in  the  multiprogramming  mix.  If  a loosely  coupled 
multiprocessor  system  communicates  through  filing  disks  Instead  of  paging 
drums f It  should  also  be  modelled  at  this  level.  Problems  such  as  file 
I/O  scheduling  should  be  studied  using  models  at  this  level.  Model  D is 
one  level  above  model  C and  the  time  between  events  in  model  C is  expressed 
in  seconds.  One  example  at  this  level  is  an  interactive  time  sharing 
system  model.  The  highest  level  storage  device  Involved  in  the  storage 
hierarchy  is  the  external  I/O  devices,  such  as  terminals.  Problems  to  be 
modelled  at  this  level  include  terminal  response  time,  job  scheduling 
algorithm,  etc.  Finally,  on  the  top  of  the  level  D model  will  be  the 
model  of  a computer  communication  network. 
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6.  THE  MEMORY  INTERFERENCE  PROBLEM  IN  A TIGHTLY  COUPLED 
MULTIPROCESSOR  SYSTEM 

Starting  from  this  section,  we  examine  various  computer  system  and 
network  models  In  a bottom  up  fashion  according  to  their  levels  in  the 
modelling  hierarchy  cited  In  the  previous  section.  A tightly  coupled 
multiprocessor  system  Is  first  examined.  In  this  system,  the  main  store 
which  is  usually  Interleaved  is  shared  by  all  processors . There  are 

two  important  performance  problems  receiving  a great  deal  of  Interest,  j 

namely  the  memory  Interference  problem  and  the  software  lockout  i 

problem.  In  this  section,  we  consider  the  memory  interference  problem. 

i 

Since  any  of  the  processors  may  access  any  of  the  memory  nodules,  when 

t 

more  than  one  processor  try  to  access  the  same  memory  module,  only  one 
of  them  can  get  access  to  the  memory  module  and  the  other  processors  will  I 

be  queued  up.  This  memory  contention  has  the  effect  of  stretching  the 
average  memory  access  time. 

Performance  analysis  of  memory  interference  has  been  studied  by 
Skinner  and  Asher  (7A]  , Bhandarkar  (S)  , Chow  [18]  , Kobayashl  [51] 
and  others.  Nevertheless,  when  each  processor  has  a private  cache,  the 

memory  interference  problem  becomes  more  complicated  due  to  the  ^ 

i 

difficulty  in  maintaining  data  Integrity.  Tang  [75]  has  recently 
proposed  a cache  system  design  of  a tightly  coupled  multiprocessor 
system.  We  first  briefly  summarize  the  design  of  this  system  and  then 
demonstrate  how  to  convert  the  design  specifications  into  an  abstract 
queueing  model.  The  multiprocessor  system  is  displayed  in  Fig.  6.1. 

The  effect  of  I/O  channels  will  be  neglected  in  the  modelling  of  memory 
Interference  as  usual.  Each  processor  has  a private  cache  in  this 
configuration.  In  a typical  uniprocessor  cache  design,  a directory  is 
used  to  translate  main  store  addresses  into  cache  locations  for  data 
blocks  in  the  cache.  In  a multiprocessor  system,  the  same  cache 
organization  can  be  used  for  each  processor,  vjith  additional  controls  to 
facilitate  cotmunicatlons  among  caches.  The  "store  only  in  cache" 
algorithm  is  adopted  in  this  design.  It  means  that  if  a processor  wants 
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to  write  and  a miss  occurs  in  the  cache,  the  data  is  always  brought  to 
the  cache  so  that  the  processor  can  always  write  to  the  cache. 

A "line"  is  blocks  that  are  associated  with  a given  main  store  » 

address.  In  a uniprocessor  cache,  a line  is  a block,  whereas  in  a 
multiprocessor  system,  a line  can  exist  in  more  than  one  cache.  A lice 
can  be  either  a shared  line  or  a private  line.  A shared  line  is  a line 
which  exists  at  caches  and  has  not  yet  been  modified  with  respect  to 
its  copy  at  the  backing  store  by  a processor.  A private  line  is  a line 
which  exists  in  a cache  and  has  been  modified  with  respect  to  its  copy  i 

at  the  backing  store  or  is  going  to  be  modified  by  its  corresponding  t 

processor.  A private  line  should  exist  in  only  one  cache  so  that  at  1 

any  moment,  throughout  the  system,  only  one  version  of  data  exists  for 
any  address.  To  control  communications  among  caches,  status  bits  would 
have  to  be  used  in  each  entry  of  the  cache  directory  to  identify  whether 
it  is  shared  or  private.  Furthermore,  a "store  controller"  is  necessary. 

The  store  controller  uses  a central  directory  to  keep  track  of  the  i 

status  of  every  line  in  each  cache,  and  makes  sure  only  one  version  of 

data  exists.  At  the  occurrence  of  a cache  fault,  either  due  to  a read 

or  a write,  or  a write  into  a shared  line  In  a cache,  a request  is  sent 

from  the  corresponding  cache  to  the  store  controller  to  take  appropriate 

actions,  e.g.  to  bring  in  a missing  line  and  to  set  the  status  of  a line 

to  private,  etc.  If  more  than  one  processor  require  the  intervention 

of  the  store  controller,  only  one  processor  can  be  served,  and  the  others 

will  be  queued  up.  So  the  store  controller  becomes  the  critical  resource 

of  the  system. 

Furthermore,  there  is  another  kind  of  interaction  among  processors 
which  degrades  the  performance.  When  a line  is  granted  the  status 
private  in  one  of  the  caches,  the  store  controller  will  scratch  the 
other  copies  of  the  line  appearing  in  other  caches.  Clearly,  each  copy 
being  scratched  Is  very  likely  still  in  the  working  set  of  the  corres- 
ponding process  and  will  be  requested  to  send  back  to  the  cache  shortly 
afterwards  through  a cache  fault.  Hence  the  interaction  Increases  the 
cache  fault  rate  of  each  processor  and  degrades  the  performance.  Let 
S(n)  be  the  scratched  rate  at  each  cache  v.’hen  there  are  n processors 
concurrently  in  execution,  i.e.,  not  pending  for  service  at  the  central 
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controller.  It  is  apparent  that  as  the  number  of  active  processors 
Increases,  the  scratched  rate  at  each  cache  will  increase.  It  seems 
quite  reasonable,  as  a first  order  approximation,  to  assume  that 

s(n)  = c (n-1) 

for  some  c,  when  n is  not  too  large. 

The  queueing  model  in  Fig.  6.2a  is  proposed  to  model  the  memory 
interference  problem.  The  store  controller  is  represented  as  a FCFS 
server.  Each  of  the  N CPU's  is  modelled  by  an  independent  server. 

Since  we  do  not  switch  tasks  at  cache  faults,  there  are  exactly  N 
processes  corresponding  to  the  N processors.  That  is  to  say,  there 
will  be  no  queue  in  front  of  any  of  the  processors.  So  we  should  in 
fact  represent  the  processor  service  in  terms  of  an  infinite  server 
station  as  in  Fig.  6.2.b.  Let  n be  the  request  rate  to  the  store 
controller  either  due  to  a cache  fault  or  a change  of  line  status  when 
there  is  only  one  processor  active.  Clearly  p depends  upon  program 
behavior,  cache  size  and  the  line  replacement  policy  of  the  cache.  Let 
d be  the  probability  that  lines  will  be  brought  in  again  shortly 
after  they  are  scratched.  Then  the  service  rate  of  the  infinite  server 
to  each  process  will  be  (y4ds(n))  when  there  are  n processes 
in  the  infinite  server. 

There  are  five  different  types  of  requests  which  can  be  Issued 
from  a cache  to  the  store  controller,  namely  shared  read,  private  read, 
declare  private,  replace  private  line,  and  replace  shared  line.  A shared 
read  or  private  read  request  is  Issued  when  a cache  fault  occurs  due 
to  a read  or  a write, respectively . A declare  private  request  is  Issued 
when  a processor  wants  to  write  into  a shared  line  in  its  cache.  A 
replace  private  line  or  replace  shared  line  request  is  Issued  when  the 
replacement  of  a private  line  or  a shared  line  is  necessary  to  make 
room  in  the  cache  for  the  new  line  coming  from  the  backing  store, 
respectively.  Certain  requests,  such  as  declare  private  and  replace 
shared  line,  do  not  involve  memory  transfers.  A replace  private  line 
request  requires  a write  into  memory.  A shared  read  or  a private  read 
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request  normally  requires  a read  from  memory,  but  if  the  line  requested 
is  a private  line  in  another  cache,  a write  into  memory  is  required. 

Let  be  the  mean  service  time  required  by  the  i-th  type  of  requests 
at  the  store  controller  and  be  its  percentage.  Then  the  mean  service 
time  of  the  store  controller  is 

T = 

i-l 

If  vje  assume  the  service  time  at  each  station  of  the  queueing  network 
to  be  exponentially  distributed,  this  queueing  network  model  will  fall 
into  the  class  of  analytically  tractable  queueing  network  models  cited 
in  Section  2 where  queue  dependent  service  rate  at  each  server  is  allowed. 


7.  THE  INTERFERENCE  PROBLEM  IN  A LOOSELY  COUPLED  ML’LTIPROCESSOR  SYSTEM 


A computer  system  is  referred  to  as  a loosely  coupled  multi- 
processor system  if  the  processors  communicate  through  secondary  stor- 
age devices  instead  of  main  memory.  The  model  for  a tightly  coupled 
multiprocessor  system  is  at  level  A and  that  for  a loosely  coupled 
multiprocessor  system  may  either  be  at  level  B or  level  C depending 
upon  whether  the  processors  communicate  through  paging  drums  or  filing 
disks  according  to  our  modelling  hierarchy  introduced  in  Section  5. 

In  Figure  7.1  we  present  a general  model  for  a loosely  coupled  com- 
puter system.  Each  box  indicated  by  CPU  can  represent  a model  of 
either  a uniprocessor  or  a tightly  coupled  multiprocessor.  In  any 
case,  based  on  the  principal  of  hierarchica''  modelling,  we  can  replace 
each  of  them  by  an  independent  service  station.  The  service  rate  of 
each  CPU  is  obtained  from  the  lower  level  model.  All  secondary  stor- 
age devices  are  represented  by  FCFS  service  stations. 


8.  TIIK  SOFTWARE  LOCKOUT  PROBLEM  TN  A NULTIPKOCI  SSOR  SYSTF.M 


Wt  now  consider  the  software  lockout  problem  in  a multiprocessor 
system.  The  software  locks  are  the  locks  associated  with  data  base 
systems  where  elements  of  the  file  system  are  locked  by  a process  to 
insure  meaningful  processing  by  itself  and  the  other  processes.  The 
lockout  problem  is  a direct  consequence  of  multiple  processor s ' attempt 
to  process  common  data  bases  asynchronously.  The  situation  resembles 
the  memory  interference  discussed  above  except  that  new  the  software 
resources  become  servers  and  the  processors  become  custemers  to  contend 
for  software  resources. 

Let  us  assume  that  locking  is  at  the  page  level  as  in  Welch  (76). 
The  major  difficulty  in  modelling  is  that  when  a processor  wants  to 
access  a shared  page  it  may  either  place  an  eycluslve  lock  or  a 
shared  lock  on  the  page.  If  a process  places  an  exclusive  lock  on  a 
page  then  only  it  can  reference  the  page.  A process  would  exclusively 
lock  a page  if  it  were  going  to  modify  the  contents  of  the  page.  If  a 
process  places  a share  lock  on  a page, then  no  other  processes  can  ex- 
clusively lock  the  page.  A process  would  share  lock  a page  if  it  did 
not  wish  the  page  to  be  changed  during  the  time  it  was  referencing  the 
page.  It  is  usually  assumed  that  the  exclusive  lock  has  higher  prior- 
ity than  the  shared  lock.  Figure  8.1  presents  a diagram  of  the  model. 

To  be  more  specific,  a request  on  a shared  page  will  get  access  to  the 
page  immediately  if  that  page  is  not  exclusively  locked  and  have  to 
wait  until  there  are  no  pending  requests  for  exclusive  lock  otherwise. 

A request  on  exclusive  lock  will  get  immediate  access  to  the  page  only 
if  the  page  is  unlocked  and  have  to  wait  until  the  requests  in  service 
and  all  other  currently  pending  requests  on  exclusive  lock  being  served 
otherwise.  When  a page  is  unlocked,  it  always  chooses,  on  first  come 
first  served  basis,  a request  on  exclusive  lock  to  serve,  if  any. 
Otherwise,  it  will  serve  all  the  requests  on  shared  lock  sirailtaneously , 
if  any. 

The  fact  that  each  shared  page  can  be  put  under  either  exclusive 
lock  or  shared  lock  complicates  the  service  discipline  of  the  service 
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center  and  makes  it  different  from  the  conventional  types  of  service 
centers  cited  in  Section  2.  Various  authors,  e.g.  Madnick  (58|, 
McCredie  [59],  Kobayashi  [51],  Welch  [76]  etc.  have  considered  the 
software  lockout  problem.  Almost  all  the  models  only  allow  one  kind 
of  lock  - exclusive  lock.  Welch  [76]  divided  the  shared  pages  into 
two  categories,  one  for  exclusively  lock  only  and  the  other  for  shared 
lock  only,  and  treated  them  separately  using  M/M/1  and  M/M/«  queueing 
models,  respectively. 

We  now  consider  a rough  stochestic  model  for  the  software  lock- 
out problem  using  a closed  queueing  network  model.  The  shared  pages 
will  be  divided  into  two  categories.  The  first  category  consists  of 
the  pages  which  are  almost  never  exclusively  locked.  The  second  cate- 
gory consists  of  the  rest  of  the  shared  pages  which  are  often  exclu- 
sively locked.  Each  shared  page  in  the  first  category  will  be  modelled 
by  an  infinite  server  station  as  if  they  were  never  locked  and  each 
shared  page  in  the  second  category  will  be  modelled  by  a first  come 
first  served  station  as  if  they  were  always  exclusively  locked.  Al- 
though the  assumption  is  not  quite  true,  it  can  be  considered  as  a 
first  order  approximation  which  leads  to  mathematical  tractability . 
Under  the  above  assumption,  the  closed  queueing  model  in  Figure  8.2  is 
proposed.  Since  there  is  no  queueing  problem  when  a processor  is  not 
referencing  pages  in  the  data  base,  this  stage  is  in  fact  an  infinite 
server  station. 


9.  MUUELS  OF  MULTIPROGRAMMED  TIME  SHARED  COMPl'TER  SYSTEMS 


As  pointed  out  at  the  beginning  of  this  chapter,  one  difficulty 
encountered  in  modelling  computer  system  is  that  tasks  or  processes 
often  occupy  more  than  one  resource  at  a time.  Consider  the  problem 
encountered  in  modelling  a multiprogranimed  paging  computer  system. 

Each  process  executing  a CPU  holds  not  only  the  prime  resource,  i.e., 
CPU,  but  also  main  memory  as  the  secondary  resource.  This  further 
complicates  the  modelling  work. 

A multi programmed  paging  computer  system  consisting  cf  a CPU, 
page  1/0  devices  and  file  I/O  devices  can  be  modelled  by  che  queueing 
network  model  shown  in  Figure  9.1.  The  CPU  station  appearing  in  the 
model  can  also  be  used  to  represent  a tightly  coupled  multiprocessor 
system  as  before.  This  model  can  be  further  extended.  If  k processors 
are  loosely  coupled  through  the  page  l/O  devices,  we  can  replace  the 
CPU  station  by  k Independent  CPU  stations  in  parallel.  If  h processors 
are  loosely  coupled  through  the  file  l/O  devices,  we  can  replicate  the 
dotted  box  h times  and  connect  them  in  parallel. 

In  order  to  capture  the  memory  effect,  let  us  first  consider  gen- 
eral program  behavior.  As  is  well  known  [23],  each  active  process  must 
be  allocated  a sufficient  amount  of  memory  to  contain  its  "working  set" 
in  order  to  avoid  "thrashing,"  i.e.  performance  collapse.  Experiment 
evidence  [ 23  ] indicates  that  the  mean  time  between  page  faults  for  a 
process  executing  in  memory  space  m has  the  general  shape  shown  in 
Figure  9.2.  This  function  is  referred  to  as  the  life  time  function 
L (m)  . (diamberlin.  Fuller  and  Liu  jlA]  pi oposeJ  a two  parameter  fit 
for  the  life  time  function. 


L(m)  = 


2S 


where  d is  the  number  of  pages  that  provides  the  process  with  half  of 
its  largest  possible  life  time  and  S is  the  expected  execution  time 
between  page  faults  when  the  process  is  allocated  memory  space  d. 
Assume  that  total  memory  available  is  of  size  M.  We  can  also  express 
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g Che  life  time  function  in  terms  of  n,  the  number  of  processes  in 

P * 

|(  memory.  Denoting  the  function  by  L (n),  we  get 

j L*(n)  = L(”) 

under  the  assumption  that  memory  is  equally  shared  among  all  programs 
in  memory . 

We  now  return  to  the  queueing  model.  Let  7 be  the  mean  execution 
time  between  file  I/O  requests  of  a process  which  is  independent  of 
the  degree  of  multiprogramming.  Furthermore,  let  and  be  the 
CPU  overheads  on  handling  a page  fault  and  a file  I/O  request  respec- 
tively. These  overheads  can  not  be  neglected  in  general.  The  service 

* _l  _i 

completion  rate  of  the  CPU  can  be  approximated  by  (L  (n')+ti)  +(y+t2) 

l.e.  the  sum  of  the  page  fault  rate  and  the  file  I/O  request  rate.  If 
j the  CPU  is  in  fact  a tightly  coupled  multiprocessor,  we  need  to  pre- 

multiply the  service  completion  rate  by  the  CPU  utilizations  obtained 
by  the  memory  Interference  model  and  the  software  lockout  model  to 
account  for  the  performance  degradation  due  to  interactions  between 
multiple  processors. 

When  a file  I/O  request  occurs,  if  we  retain  the  task  in  main 
memory,  the  number  of  processes  in  memory  will  be  fixed,  so  does  the 
service  rate  of  the  CPU.  Hence,  the  queueing  network  model  will  fit 
into  the  class  of  models  defined  in  Section  2.  However,  a more  real- 
istic case  is  to  swap  the  process  requesting  a file  I/O  operation  out 
of  memory.  In  this  case  the  number  of  processes  in  main  memory  is 
equal  to  the  number  of  processes  in  the  CPU  station  and  page  I/O 
devices.  That  Is  to  say  the  service  completion  rate  of  the  CPU  de- 
pends on  the  total  number  of  processes  in  the  two  stations,  not  on 
the  number  of  processes  in  itself  alone.  This  kind  of  situation  does 
not  fit  into  the  framework  of  the  queueing  network  models  introduced 
in  Section  2.  Nevertheless,  the  problem  can  be  solved  by  applying  hierar- 
! chical  modelling.  Using  the  fact  that  the  mean  time  between  page  faults 

is  usually  an  order  of  magnitude  smaller  than  the  mean  time  between  file 

, . I/O's,  l.e.  the  interaction  between  the  CPU  and  page  I/O  devices  is 

* 

much  stronger  than  their  interactions  to  the  file  I/O  devices,  we  can 
consider  the  CPU  and  page  I/O  devices  separately  from  the  file  I/O 
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devices,  A closed  queueing  network  model,  refered  to  as  CPU  ?GU  model, 
in  Figure  9.3  is  proposed  to  model  the  interactions  between  the  CPU  and 
page  I/O  devices.  For  a given  N,  the  number  of  processes  in  main 
memory,  the  service  completion  rates  of  the  CPU  and  page  I/O  devices 
are  fixed.  So  the  model  fit  into  the  class  of  models  described  in 
Section  2.  We  can  obtain  CPU  utilization  for  various  values  of  N, 
which  will  be  needed  in  the  next  level  model.  Then,  going  up  one  level 
of  the  modelling  hierarchy,  we  can  use  the  closed  queueing  network 
model  in  Figure  9.4  to  model  the  overall  system.  New  the  subsystem 
which  consists  of  the  CPU  and  page  I/O  devices  is  modeled  by  a single 
server  whose  service  completion  rate  depending  upon  the  number  of  pro- 
cesses at  this  service  center  is  equal  to  the  CPU  utilization,  which 
we  obtain  in  the  lower  level  model,  divided  by  the  mean  execution  time 
between  file  I/O  requests  of  a process.  Some  numerical  results  on 
similar  models  can  be  found  in  (9]. 

We  may  further  consider  a queueing  model  for  an  interactive  time 

shared  computer  system.  In  this  system,  we  need  to  model  not  only  the 

multlprogrammed  characteristic  of  the  system,  but  also  the  terminal 

behavior  and  the  job  scheduling  policy  of  the  operating  system.  It  is 

well  known  that  if  we  plot  the  processor  utilization  obtained  by  the 

multlprogrammed  model  in  Figure  9.4,  the  curve  should  take  the  shape 

of  the  dotted  curve  in  Figure  9.5.  [20]  [5]  Note  that  the  dotted 

curve  and  the  solid  curve  coincide  for  N < N . One  of  the  major 

max 

responsibility  of  the  job  scheduling  algorithm  is  to  make  sure  that 
the  memory  is  not  overcommited . That  is  to  say  the  job  scheduling 
routine  will  impose  on  upper  bound  N on  the  number  of  jobs  being 
activated  in  main  memory.  Let  N.p  be  the  number  of  terminals  in  the 
system.  A closed  queueing  model  in  Kobayashi  [51]  is  presented  in 
Figure  9. 6. a.  As  we  can  see  the  multiprograramed  computer  system  model 
appears  as  a submodel  of  the  system.  The  external  "job  scheduling" 
queue  controlled  by  the  scheduling  routine  makes  the  models  deviate 
from  conventional  queueing  model.  Since  the  multlprogrammed  computer 
system  model  is  one  step  lower  in  the  modelling  hierarchy,  we  can 
separate  it  from  the  rest  of  tlie  system  and  model  it  independently  as 
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Fig.  9.5  CPU  UTILIZATION  OF  A MULTIPROGRAMMED  COMPUTER 
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before  except  with  a constraint  on  the  maximum  degree  of  multiprogram- 
ming imposed  by  the  job  scheduling  routine.  The  CPU  utilization  is  now- 
given  by  the  solid  curve  in  Figure  9.5.  [20]  [51]  The  difference 
between  the  solid  curve  and  dotted  curve  in  Figure  9.5  for  N > N max 
is  due  to  the  upper  bound  on  degree  of  multiprogramming.  Now  we  can 
replace  the  multiprogramming  subsystem  model  by  a single  service  sta- 
tion whose  service  completion  rate  is  equal  to  the  CPU  utilization 
indicated  by  the  solid  curve  in  Figure  9.5  divided  by  the  mean  CPU 
execution  time  of  a job.  The  job  scheduling  queue  now  becomes  an 
ordinary  queue  of  this  service  station.  If  the  scheduling  discipline 
used  by  the  scheduling  routine  is  round  robin,  we  should  represent  the 
service  station  by  a processor  sharing  station  which  is  a mathematical 
idealization  of  the  multiprogramming  system  with  time  slicing.  The 
terminals  will  be  represented  by  an  infinite  server  station  since  no 
contention  at  the  terminals  is  assumed.  Hence,  the  simplified  model 
at  the  top  level  will  be  a closed  queueing  network  with  two  service 
stations  as  shown  in  Figure  9.6.b. 
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10.  MODELS  OF  STORE  AND  FORWARD  COMPUTER  COMMUNICATION  NETWORKS  VIA 

TERRESTRIAL  LINKS 

Modelling  of  store  and  forward  computer  communication  networks  via 
terrestrial  links  is  primarily  due  to  Kleinrock  (47,  48).  If  we  view 
channels  as  servers  and  messages  as  customers  or  tasks,  the  problem 
seems  to  be  again  reduced  to  a queueing  network  model.  Actually,  the 
situation  is  not  that  simple.  All  results  on  queueing  network  models 
are  derived  under  the  assumption  that  the  service  time  at  each  server 
is  independent  of  each  other.  In  a computer  communication  network, 
the  randomness  of  service  time  comes  from  the  randomness  of  message 
length  and  the  length  of  a message  is  not  going  to  vary  as  the  message 
hops  through  the  network.  Furthermore,  the  arrival  process  of  messages 
due  to  the  Internal  traffic  in  the  network  is  not  independent  of  their 
service  times.  [48]  Nevertheless,  in  [2],  Kleinrock  investigates  the 
validity  of  the  "message  independence  assumption"  by  extensive  simula- 
tions. The  "message  independence  assumption"  refers  to  the  property 
that  the  performance  of  a network  will  not  change  if  the  message  size 
is  sampled  independently  at  each  node  from  the  underlying  common  mes- 
sage size  distribution  instead  of  keeping  the  message  size  fixed  when 
transmitting  through  the  network.  The  simulation  results  indicate  that 
under  most  conditions  of  interest  this  assumption  is  reasonable.  The 
message  independent  assumption  provides  us  with  the  necessary  founda- 
tion for  applying  the  queueing  network  models  to  analyze  store  and 
forward  computer  communication  networks  via  terrestrial  links. 

Nevertheless,  an  infinite  storage  capacity  at  each  store  and 
forward  node  has  been  Implicitly  assumed  in  the  model.  Lam  (56l  re- 
laxed this  assumption  and  presented  a queueing  network  model  for  each 
store  and  forward  node  to  incorporate  the  constraints  of  finite  nodal 
storage  capacities,  as  wet!  as  the  channel  transmission  control  mech- 
anisms of  positive  acknowledgement  and  time-out  of  packets.  This 
model  is  presented  in  Figure  10.1.  Finally  these  single  node  results 
are  interfaced  by  imposing  a continuity  of  flow  constraint.  Again  we 
see,  depending  upon  the  levels  of  details  required,  queueing  models 
of  different  complexities  may  be  employed. 
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discard 


1 = output  processor  (FCFS) 

2 = time-out  delay  (IS) 

3 = acknowledgement  delay  (IS) 


Fig,  10,1  THE  NODAL  MODEL  OF  A COMPUTER  COMMUNICATION  NETWORK  [56] 


1 1 , REMARK 


While  analytic  results  are  clearly  not  powerful  enough  to  provide 
a "cookbook"  approach  to  performance  analysis  and  prediction,  general 
methodology  and  difficulties  on  model  formulation  are  discussed  through 
examinations  of  various  computer  system  and  network  models.  These 
models  are  presented  in  a systematic  way  based  on  the  powerful  modeilin 
methodology  - hierarchical  modelling. 
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